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Abstract — Unitary gates are interesting resources for 
quantum communication in part because they are always 
invertible and are intrinsically bidirectional. This paper 
explores these two symmetries: time-reversal and exchange 
of Alice and Bob. We will present examples of unitary gates 
that exhibit dramatic separations between forward and 
backward capacities (even when the back communication is 
assisted by free entanglement) and between entanglement- 
assisted and unassisted capacities, among many others. 
Along the way, we will give a general time-reversal rule for 
relating the capacities of a unitary gate and its inverse that 
will explain why previous attempts at finding asymmetric 
capacities failed. Finally, we will see how the ability to erase 
quantum information and destroy entanglement can be a 
valuable resource for quantum communication. 

I. Introduction: Communication using 

BIPARTITE UNITARY GATES 

THIS paper investigates the asymptotic communica- 
tion capacities of bipartite unitary quantum gates; 
for example, a CNOT gate with control qubit held 
by Alice and target qubit held by Bob. For a review 
of this topic see ||7]|23] and references therein. The 
question of unitary gate capacity arises when studying 
our ability to communicate or generate entanglement us- 
ing naturally occurring physical interactions; moreover, 
studying unitary gate capacity has often led to new ideas 
that are useful for other topics in quantum information 
theory [22 1 . 

In some ways unitary gates are like classical bidi- 
rectional channels or noisy quantum channels, but they 
are both more complex than one-way channels because 
of their intrinsic bidirectionality, and simpler than noisy 
channels because they involve no interactions with the 
environment. For example, any nonlocal unitary gate has 
nonzero capacities to send classical messages in either 
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direction and to create entanglement f3', Til. By contrast, 
bidirectional classical channels exist that have no ca- 
pacity in either direction, but can be useful for nonlocal 
tasks like reducing communication complexity! 15 1. Even 
deterministic classical bidirectional channels, like the 
classical CNOT, can have capacities that are nonzero 
only in one direction. Another feature of unitary gates 
is that, unlike noisy quantum channels, knowing the 
classical capacity (as a function of the amount of entan- 
glement assistance) of a unitary gate also determines its 
quantum capacity (again parameterized by the amount 
of entanglement assistance). Moreover, allowing free 
classical communication does not improve the entangle- 
ment capacity; on the other hand, the quantum capacity 
appears to no longer be simply equal to the entanglement 
generating capacity. In short, the usual questions (like 
additivity) about noisy channel capacities are replaced 
by an intriguingly different, yet perhaps related, set of 
questions about unitary gate capacities. 

In this paper we will investigate the questions of 
symmetry, both time-reversal and exchange of Alice and 
Bob, that arise in connection with unitary gate capacities. 
We will demonstrate 

• A general rule for relating capacity regions of a 
gate U to those of its inverse (Sec. |ll]i. Along 
the way, we recast the main result of 1251 as a sort 
of structure theorem for communication protocols 
based on unitary gates, which leads us to propose 
a new way to view the capacity region of unitary 
gates. 

• A gate that exhibits nearly the strongest possible 
separation between forward and backward capac- 
ities, even when free entanglement is allowed for 
back communication (Sec. Ullb . 

• A gate with a nearly maximal separation between its 
ability to create and to destroy entanglement. This 
gate also exhibits a near-maximal improvement in 
communication capacity when assisted by entangle- 
ment (Sec. lIVI i. (A variant of the former result was 
independently proved for a different gate in ||38l .) 

• A quantum communication resource, "coherent era- 



sure," that can be thought of as the time-reversal of 
coherent classical communication (Sec. |V]l. 

• A more restricted type of resource inequality, which 
we call a "clean resource inequality." (described in 
more detail below) 

• Alternate proofs for the two main unitary gate 
capacity theorems that are currently known (the 
Appendix). The proofs are simpler and establish 
slightly stronger versions of the capacity theorems; 
we also include them because they make this paper 
a self-contained summary of almost every result to 
date on asymptotic unitary gate capacities. 

As we will see, a number of speculative claims about 
unitary gate capacities remain to be fully resolved, and 
more interestingly, we are only beginning to pose our 
questions about them in the right way. We conclude in 
Sec. |Vl] with some ideas about future research. 

The remainder of this section reviews notation and 
some background results. Following lfT9ll23l we state 
our coding theorems in the language of asymptotic 
resource inequalities. The basic asymptotic resources 
are [c c] (one use of a noiseless classical chan- 
nel from Alice to Bob, a.k.a. a cbit), [qq] (the state 
1*'*'*^ = 75 Ex=o |2^>^l^>^' ^-k-a- anebit)and [q ^ q] 
(one use of a noiseless quantum channel, a.k.a. a qubit). 
Protocols transforming these resources into one another 
(e.g. teleportation) are expressed as asymptotic resource 
inequalities such as 2[c c] + [qq] > [q q]. We 
will also make use of coherent bits, or cobits, which 
are denoted [q — > qq] and correspond to the isometry 
12x=o\^)'^\^)^ {^\^ ■ Coherent bits were introduced in 
II22I which proved that 2[q ^ qq] = g] + [qq] 

(though only as an asymptotic relation; see II2TI for a 
single-shot version). Since we are interested in two-way 
communication, define [c ^ c], <— q] and [qq q] to 
be cbits, qubits, and cobits, respectively, sent from Bob 
to Alice. These definitions are summarized in Sec. IVIII 

For a unitary gate U, let (U) denote the corresponding 
asymptotic resource; we can use it to state resource 
inequalities such as (CNOT) > [c ^ c]. Define 
CCE([/) to be the three-dimensional capacity region 
of U to send cbits forward, send cbits backwards and 
generate entanglement: 

CCE(C/) -.= {{01,02, E) : 

{U)>0i[c^c] + 02[c^c] + E[qq]}. (1) 

For example, if we define SWAP to exchange a qubit 
of Alice's with a qubit of Bob's, then (1,1,0) G 
CCE(SWAP), since one use of SWAPcan be used to 
send one bit forward and another bit backwards at 
the same time. When Oi,02,E take on negative val- 



ues, we move the corresponding resources to the left- 
hand side of the resource inequality, e.g. E < for 
entanglement-assisted communication. Continuing with 
the SWAPexample, we could use super-dense coding to 
consume 2 ebits and send 2 cbits in either direction: thus 
(2, 2, -2) e CCE(SWAP). We can define capacities in 
terms of CCE([/) as extremal points of the region: the 
entanglement capacity E{U) := max{E : (0, 0, E) e 
CCE(C/)}, the forward classical capacity 0^{U) := 
max{C : (C, 0, 0) G CCE(;7)}, the backwards ca- 
pacity 0^{U) := max{C : (0,C,0) G CCE(;7)}, 
the simultaneous capacity 0+{U) := max{Ci + O2 ■ 
(Ci, C2, 0) e CCE(J7)}, and entanglement-assisted ver- 
sions O'iiU) := max{C : (C,0,-oo) e CCE(?7)}, 
OtiU) := max{C : (0, C, -c5o) e CCE(C/)} and 
Of{U) := max{Ci + O2 : (Ci,C2,oo) e CCE{U)}. 
In [3, 7 1, it was shown that one of these capacities is 
nonzero if and only if all of them are nonzero. Various 
quantitative relations among these capacities were also 
shown, but they will be subsumed in what follows. 

We can analogously define the capacity region of 
achievable rates of entanglement generation and coherent 
communication in both directions. In fl5\ this region was 
shown to coincide with CCE(;7) for the Oi,02 > 
quadrant, and to be trivially related for other quadrants. 
Here we will present this result in a slightly stronger 
form. If a = (q!„)^i and (3 = {(3n)'^=i are two /7Mr41] 
asymptotic resources such that a > (3, then we say that 

clean 

this resource inequality is clean (denoted a > /3) if 
a„ can be mapped to [3^(1 -s^) using a protocol that 
discards only nS'^^ qubits, which (up to error e„) are all 
in the state |0), where e„, (S„, (5^ ^ as rt ^ ooQ We 
will also call protocols "semi-clean" when, at the end 
of the protocol, they discard arbitrary nJ^j-qubit states 
which depend only on n and not on any other inputs or 
outputs of the protocol. In most cases of interest, semi- 
clean protocols are also clean. 

Implicit in the definition of a clean protocol is the 
idea that all local quantum operations are represented by 
isometrics (perhaps increasing the dimension) followed 
by discarding some qubits. The advantage of this formu- 
lation is that apart from the discarding step, protocols 
can be easily reversed. On the other hand, requiring 
that resources be pure is quite a restrictive condition, 
and hopefully future work will able to fruitfully relax 
it. Many common resource inequalities, such as entan- 

' Following [191 we say that a resource is pure if it is an isometry 
or a pure state. 

^The idea that discarding information should be costly dates back 
to Szilard's interpretation in 1929 of Maxwell's demon|45| (see also 
and in the context of quantum Shannon theory has been discussed 
in EI]. 
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glement concentration/dilution, remote state preparation, 
channel coding, etc., can be shown to admit "clean" ver- 
sions, but other simple inequalities such as 2[qq\ > [qq\ 
do not. Now define 

clCCE(C/) {{Ci.,C2,E) : 

{U) > Ci[q^qq]+C2[qq^q] + E[qq]}. (2) 



Refs II23II25II considered the similar region CoCoE(C/) 
in which there was no requirement that the protocols be 
clean. Although clCCE(C/) is still convex, it is no longer 
monotone in the sense that throwing away resources 
does not always yield valid protocols; for example, while 
points like (0, 0, — oo) are in CCE(C/), one can show 
that E{W) = max{£; : (0,0,-£;) G clCCE(C/)}. This 
result can be proven directly using the formula for E{U) 
in IItI IJTI . and will also follow from a more general 
theorem relating clCCE(C/) to c\CCE{W) that we prove 
in Sec. M 

The main use of clCCE in this paper will be the 
following strengthening of ESl 's main result. 

Theorem 1 (std. form of unitary protocols): If 

([/) > [c ^ c] + C2 [c ^ c] + E[qq] 

and Ci , 6*2 > 0, then there exists E' > E such that 

{U) > Ci[q^qq] + C2[qq^q]+E'[qq]. 
In other words, there exists a series of protocols 

(K)^^Li, with 

(3) 

for local isometrics Ag"' , Sq""* , . . . , A^n ^ , iJ^"' (pos- 
sibly adding ancillas), such that for all x G 

{0, l}"(Ci-<5..)^2^ £ {0, i}»(C2-5„)^ we have 



«(£-<5.)(|oo)^^)«"". (4) 
as rt ^ CX3 and p k, a means that 



Here e„,(5„,(5^j 
cr||l < £■ 

In terms of CCE and clCCE, this means that for 
any {Ci,C2,E) e CCE there exists E' > E such that 
{Ci,C2,E') e clCCE. 

Ref. 1 23 1 sketched how to extend the proof of ||25l to 
obtain the above theorem, but we will make use of ll24l 
to give a more rigorous derivation in the Appendix. 

Finally, we state a single-shot expression for the trade- 
off curve between ebits and cbits sent from Alice to 
BobEllEl; call this tradeoff curve CE(C/) and define 
it to be CE(t/) ~ {{C,E) : {C,0,E) G CCE(t/)}. 



Similarly define clCE([/) := {{C,E) : {C,0,E) e 
clCCE([/)}. Before we can state our expression for 
clCE([/), we will need a few more definitions (following 
1 191). For a state ^-^sc ^ ^^^^^abc ^ ^.^^^jj 

tion of the von Neumann entropy as H{A) = H{A)^ = 
H{iP^) = -Tr(V'^logV'^), where = Trsc V'^^^- 
Similarly the quantum mutual information is I{A\ B) = 
H{A) + H{B)-H{AB) and the conditional information 
is defined as H{B\A) = H{AB) - H{A). Here, and 
elsewhere, we omit subscripts when the underlying state 
is obvious. We will denote an ensemble of pure states by 
^XABA'B' ^ J2.P^\^){^f ® Here 
X is a classical label, the gate U acts on AB, and 
A'B' are ancilla systems of arbitrary finite dimension. 
Let U{£) stand for (J7^^ (g) In terms of 

these ensembles we can define 

AiMU) ■.= {{C,E) -.BS s.t. 

I{X;BB%(^£^^I{X;BB')£>C 
and H{BB'\X)ui£) - H{BB'\X)£ = E} , (5) 

where £ is an ensemble of bipartite pure states in 
ABA' B' conditioned on a classical register X. This 
corresponds to the set of single-shot increases in mu- 
tual information {I{X; BB')) and average entanglement 
iH{BB'\X)) that are possible. It turns out that these in- 
creases are also achievable asymptotically, as expressed 
in the following theorem: 

Theorem 2: clCE([/) is equal to the closure of 

The direct coding theorem was proven in ll22l and the 
converse in |(23l Section 3.4.2]. In the Appendix we will 
give a new, and more self-contained, proof of the coding 
theorem. 

II. Reversing unitary communication 

PROTOCOLS 

In this section, we present a general theorem for 
relating the capacity region of U with the capacity region 
of C/^ . Many of the key ideas are illustrated by the gate 
C^xoxo, which was conjectured in fT\ to have asymmetric 
communication capacities. Uxoxo acts on a 2™ x 2™- 
dimensional space, with m a parameter, and is defined 



C^xoxo|a;0) = \xx) 
t^xoxo|a:a;) = |a;0) 
Uxoxo\xy) = la;?;) 



V2:G {0,1}" 
Vx e {0, 1}™ 



Clearly (Uxoxo) > "n^lq — > qq], but at first glance it 
appears that Uxoxo cannot easily be used for communi- 
cation from Bob to Alice. Indeed 1231 proved that when 
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starting without correlation or entanglement, a single use 
of Uxoxo could not send more than O.Trn + o(m) bits 
from Bob to Alice. 

However, by consuming entanglement, Uxoxo can be 
used to send rn bits from Bob to Alice. The protocol is 
as follows: 

(1) Start with 2-™/2X;^g2;" l^>'^l^)^- 

(2) To encode message bi, . . . , bm, 
. . Z'^'j^ and obtains the 

applied to yield the 



applies Z-^ 



(3) 



Uxoxo is 

2-m/2 



Bob 
state 

state 



.(_l)b-|f)'^|0)^. 

(4) Alice applies 77**™ and obtains 
Thus (Uxoxo) + mlqq] > m[qq ^ q 



) \or. 

As a corollary. 



C^{Uxoxo) > m/2. If we could prove that this were 
roughly tight (say that C^{Uxoxo) < rn/2 + o{m)), 
then we might conclude that forward and backward 
capacities can be separated by a constant factor, but that 
this separation vanishes when entanglement is allowed 
for free. We will later demonstrate unitary gates with 
much stronger separations, even between entanglement- 
assisted capacities. 

First, we can generalize the backward communication 
protocol of Uxoxo to obtain the following result: 

Theorems: {Ci,C2,E) e clCCE([/) ^ 
(C2, Ci, -E-Ci- C2) G clCCE([/t) 

Proof: The proof follows almost immediately from 
Theorem[T] Suppose {Ci,C2,E) e clCCE([/), so that 
for any 6,e > and all n sufficiently large there exists 
a protocol Vn of the form of Eq. (O that satisfies 
Eq. (|4|i. We can assume WLOG that the local isometrics 
in Eq. (O are in fact unitaries, with all the ancillas 
being added at the beginning. Then we take the complex 
conjugate of Eq. ^ to obtain 

vl = (4") ® i?("Vc/t . . . c/t(4") ^ B^^y)^. (6) 

Observe that is a protocol that uses W n times 
together with with local resources. AUce and Bob have 
only to create the nS'^ copies of |00), which they 
can do using local isometrics for free. Then they can 
use W n times to apply 7-"^. Up to error e„, this 
maps \x)^\x)^ Ix)-^ for x S {0, and 
\y)^\y)^ \y)^ for y e {0, while con- 
suming n{E+Sn) ebits (or generating ~n{E+Sn) ebits). 
By applying the protocol outlined above for Uxoxo, this 
can be used to send n{C2 — Sn) cobits from Alice to 
Bob and n(Ci — (5„) cobits from Bob to Alice, while 
consuming n{Ci + C2 + E — (5„) ebits (or generating 
-n{Ci + C2 + E - Sn) ebits). ■ 
From the definition of entanglement capacity we now 
obtain a statement claimed in the last section. 



Corollary 1: For any unitary U, ElU^) = 
inax{E : (0,0,-^;) e clCCE(C/)}. 

We can also obtain a few immediate corollaries for 
the case of free entanglement. 

Corollary 2: For any unitary U, 

(1) (Ci,C2,-oo) e CCE(;7) ^ (C2,Ci,-oo)e 
CCE(t/t). 

(2) In particular, C5(t/) = C^{W) 

(3) IfU^W then C5([/) = C5([/). 

(4) IfU^W then cSiU) > C£(C/)/2. 

The only nontrivial claim here is (4). To prove it, 
first note that the entanglement assisted capacity can 
be achieved with the assistance of < E{U^) ebits. 
This is because {C,0,-E) e clCCE(J7) implies 
(0, 0, —E) e clCCE(C/) (since cobits can always be dis- 
carded cleanly) and Theorem[3]implies that E{U^) > E. 
Since we have assumed U = W, we have E{U) = 
E{U^) > E. Thus two uses of U can send C cobits: the 
first use generates E ebits and the second use consumes 
them to send C cobits. (A similar result was proved in 
11121.) 

Note that cases (3) and (4) of Corollary |2] apply 
to Uxoxo, and show us why we should not expect a 
dramatic separation of capacity for Uxoxo, or indeed 
any gate equal to its inverse. However, a straightforward 
modification of the argument in the next section can 
be used to prove [T|'s conjecture that C^{Uxoxo) < 
m/2 + o{m) for m large, which nearly saturates the 
bound C^{U) > C^{U)/2 that we now understand for 
gates satisfying U = W . 

An alternate proof of the reversal theorem can be 
obtained from the resource equality 2[q ^ qq] ^'l^?" [g 
q] + [qq]. Theorem [3] is then equivalent to the claim 
that {Q,,Q2,E) g clQQE([/) ^ {Q2,Qi,-E) e 
clQQE([/^). See iflTl for similar examples of reversing 
quantum communication protocols. 

III. V„i: A GATE WITH ASYMMETRIC CAPACITIES 

Guided by Theorem [3] we will construct a gate that is 
quite different from its inverse. Again choosing a positive 
integer to as a parameter, define Vm on (8) by 



Vm\x)%)'' 



\x)^\y — 1)^ for < y < a; 



ym\x)^\y)^ = \x)'^\y)^ foiy>x 

The first line means that {Vm) > m[q ^ qq]. Though we 
will not need this fact, it turns out that C^{Vm) — to. 
The proof, following a similar argument for Uxoxo in 
Q is as follows. Since Vm = J2x \x){x\i^ for some 
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operators B^, we know that the Schmidt rank of Vm is 
< 2™. Thus 

m < C^iVrn) < E{V^) < logSch(K„) < m 

and each inequality must be an equality. 

To bound the back communication of V^, we now 
claim that V,n can be simulated to within an accuracy of e 
by using mlq qq] and 0{\ogm/e){[q q] + [q ^ q])- 
We would like to say that this implies 

m[q qq] + 0{logm)([q q] + [q ^ q]) 

> (Vm) > m[q ^ qq]., (7) 

but our tools are not strong enough to actually prove this. 
This is because simulating n copies of Vm to constant 
accuracy would require 0{n{m + lognm)) qubits of 
communication, which is superlinear in n for any fixed 
m. However, for now suppose that Eq. (|7]) were true. It 
would imply that 

Ct [Vm) < 0(log m) « m - (8) 

a rather dramatic separation between forward and back- 
ward capacities, even when we allow free entanglement 
to assist the back communication. By using techniques 
specialized to unitary gates, we will give a proof of 
Eq. (|8]l later in this section; the proof is inspired by 
Eq. (|7]l, but of course does not rely on it. 

Our simulation also means that Vm could be thought 
of as almost equivalent, at least for large m, to the 
resource of coherent classical communication. This is 
interesting both because it is more natural to implement 
cobits as a unitary gate than as an isometry and because 
unitary gates, unlike isometrics, are reversible. We will 
return to this second point in the next section when we 
discuss VjI^. However, we cannot state this as a more 
precise statement about asymptotic resources since the 
sequence {Vm)m=i does not fit ||T9l 's definition of an 
asymptotic resource. 

A. A simulation for Vm 

In this section, we show how Vm can be simulated 
up to error e by a protocol that uses m[(7 — > qq] and 
0(log m/e)([g — > q] + [q ^ q])- A key subroutine used 
in the simulation is a classical communication protocol 
for distributed comparison. Suppose x,y G {0,1}™, 
Alice holds x and Bob holds y, which we interpret 
as integers between and 2™ — 1. Then for any error 
probability e > they can probabilistically determine 
whether x = y, x>yoTx<y using 0(logTO/e) 
bits of communication BOl . The comparison protocol is 
designed for classical information, but our simulation of 
Vm will run it coherently using quantum communication. 



For the simulation for Vm, suppose Alice and Bob 
start with \x)^'-\y)'^^ for x,y £ {0,1}™. Our protocol 
is as follows. 

(1) Define the indicator variable w to be 1 if y = 0, 2 
if < y < a; or 3 if y > X. Use 0(logTO/e) qubits 
of communication in either direction to coherently 
compute w. This leaves them (up to error e) with 
the state 

\xf'\yf'\w}^^\wf^\f{x,y))^'''^ 

and where f{x,y) is the state of the ancillas 
produced by the comparison subprotocol. Using 
a standard procedure (compute \w), copy it to a 
new register and then uncompute the first copy of 
\w) along with the ancilla states produced along 
the way), Alice and Bob can eliminate the ancilla 
register to hold simply 

\xf'\yf'\w)^^\wf\ 

again up to error e. 

(2) Use m[q qq] as follows: 

• If w = 1, then Alice inputs which maps 

to the state \x)^''\x)^^. Since y 0, the Bi 
register is in the 10) state. Bob swaps Bi and 
Bs to obtain |x)^'|0)^\ 

. If w = 2 or 3, then Alice inputs |0)'*\ 
which maps to the state |G)'^'^|0)^^. Then she 
discards A3. 

In either case Bob discards register B3, which 
always contains the |0) state. 

(3) If w 2 then Bob maps \y)^' to \y - 

(4) Alice and Bob use an additional 0(logm/e^ qubits 
of communication to uncompute I lu) ^jui) ^.This 
point is slightly subtle, as the meaning of w as 
changed: now w = 1 means that y = x, w = 2 
means that y < x and w ~ 3 means that y > x. 
Thus, Alice and Bob will compute \w')^^\w')^^ 
corresponding to these new cases, and will each 
map \w) \w') to \w © w' (mod 3)) \w'). With prob- 
ability > 1 — 2e we have w = w', so this opera- 
tion effectively erases \w). Then they uncompute 

along with all the ancilla produced 

along the way. 
The entire procedure uses m[q qq] + 

0(logm/e)([g q] + [q ^ <}])■ To see that the 
protocol works, first observe that in an ideal protocol 
where equality testing was perfectly accurate we would 
obtain precisely Vm- Thus when we replace equality 
testing with an approximate version that is accurate 
(in the sense of cb-norm ll34l ) to within e, the overall 
protocol has error < 2e. 
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B. Bounding the backwards capacity of Vm 

We now use our simulation for Vm to prove that 
C^{Vm) < O(logm). Since the cost of our simulation 
depends on the desired error rate, standard resource 
arguments are not enough to prove this claim. Instead, 
let F,^ denote the result of simulating Vm to accuracy e 
using the above procedure. Since V^ is built out of cobits 
and qubits, it will be an isometry rather than a unitary 
operator. This is not a serious problem: Theorem |2] 
still applies and by extending the space that Vm acts 
on, we have \\Vm - K^II°o < where ||X||oo 
™^'^|-0)-(V'|'0)=i "^'<^\-^ Moreover, we can sim- 

ulate V^ exactly using rn[q — > qq\ + 0(logr7i/e)([g 
9] + [9 ^ <?])■ Thus Ct{V^) < 0{\ogm/e). Then we 
will conclude that C^{Vm) < O(logTO) by choosing 
e = 1 /to and applying the following lemma: 

Lemma 1 (Continuity of one-way capacity): If U 
and V are isometrics with outputs in ® such 
that \\U - V^llcb < e then for all {C,E) e clCE(;7) 
there exists {C',E') e clCE(y) such that \C~C'\ < e' 
and \E - E'\ < e', where e' = Selogd + 4i/2(e) and 
H2{e) :=-eloge-(l-e)log(l-e). 

Proof: By Theorem |2] for any 6 > there exists an 
ensemble of bipartite pure states £^^ba b ^^^j^ jj^^j. 

I{X;BB')u(^£)-I{X;BB')£>C^S (9) 
\H{BB'\X)u^£) - H{BB'\X)£ - E\ < S, (10) 

where U acts on the d-dimensional systems A and S, 
while X, A' and i?' can have arbitrarily large dimension. 
Thus we will need to use a recently proved variant 
of Fannes' inequality|2| which bounds the change in 
relative entropy as a function of the dimension of only 
the first system. Begin by using the chain rule lfT4l to 
express H(BB'\X) as H{B\B' X) + H{B'\X). Ref. E) 
states that if \\U{£) - V{£)\\i < e then 

\HiB\B'X)u^s) - HiB\B'X)y^s)\ < 2i/2(e)+4elogd, 

where d — dimi?. On the other hand 

£^'^ = U{£)^'^ = Vi£)^'^, so H(B'\X)u(£) - 
H{B'\X)vi£) and 

\H{BB'\X)u(£) - H{BB'\X)y^£)\ < 2H2{e)+Ae\ogd. 

Similarly I{X;BB') = H{BB') - H{BB'\X) = 
H{B') + H{B\B') - H{BB'\X) and H{B') is un- 
changed by applying a unitary to AB, so 

\I[X■BB')u^^s) - I{X-BB')y(£) \ < AH2{e)+8e\ogd. 

If we now take (5 — > and apply Theorem |2] again to 
relate Ai^e{V) to clCE(y), we obtain the proof of the 
lemma. ■ 



Remark: We suspect that the entire two-way commu- 
nication capacity region CCE(?7) is similarly contin- 
uous. However, without a characterization of the two- 
capacity analogous to Theorem |2] the proof technique 
used in Lemma [U will not work. 

IV. More asymmetry: the capacity region of 

V^ 

^ m 

In this section we will demonstrate nearly tight bounds 
for the capacity region of V^]^, just as we did with Vm- 
We will find large separations between entanglement- 
assisted and -unassisted capacities, as well as between 
entanglement-creation and -destruction capacities. 

First use Theorem [3] to show that (V^) + rn[qq] > 
iTi[qq q], or equivalently, that {VX) > iTi{[qq ^ q] — 
[qq])- Next, we will present an approximate simulation 
for V^jj that uses a nearly optimal amount of communi- 
cation; i.e. barely more than m{[qq ^ q] ~ [qq]). Thus, 
by analogy with the nearly optimal simulation of Vm in 
Eq. (|7|l, we will have 

■m[qq ^ q] + O {log m/e){[q q] + [q ^ q]) 

> {VD + m[qq] > m[qq ^ q]. (11) 

Again, the first inequality is not really a resource in- 
equality, since it doesn't give us a way of having the 
overall error vanish when simulating n copies of V^l^ 
using (1 + o(l))rt times the resource cost. However, it 
will still be enough for us to give us nearly tight bounds 
on the capacity region of V^jj for to large. 

We begin by discussing this capacity region. Choosing 
e = 1/to and again using the Continuity Lemma, we 
can show C^iV^) < 0{logm) < to < C!^{Vi), a 
similar sort of capacity separation between forward and 
backwards communication. However, V^ also exhibits 
a dramatic gap between entanglement-assisted and - 
unassisted capacity. Since mlqq ^ g] + 0{\ogm){[q 
1] + [q ^ q]) can create no more than to + O(logTO) 
ebits, the Continuity Lemma implies that (V^) + m[qq] 
also must have entanglement capacity < TO + 0(logTO). 
Finally we use the fact that the entanglement capacity of 
isometrics is additive (from 17 1371 as well as Theorem|2l) 
to estabHsh that E{VX) < 0{\ogm). 

This implies that all unassisted capacities are small; 
for example C+{Vl) < E{Vl) < 0{\ogm). Thus 
V^ is almost useless without entanglement; none of its 
unassisted capacities are greater than O(logTO). On the 
other hand, its capacity (from Bob to Alice) rises when 
entanglement is supplied, at a rate of nearly one cbit 
per ebit. No such behavior is known for noisy quan- 
tum channels, although there are qudit channels with 
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0{d) multiplicative separations between entanglement- 
assisted and unassisted capacitiesll9l [TOl l32]Fi We also 
obtain a separation between entanglement-creating and 
-destroying capabilities: E{V^^) < O(logm) <^ m — 
E{Vm)- (An independently derived, and completely dif- 
ferent, example of a gate with E{U) ^ E{W) is in 

There are two ways we can derive the simulation of 
1^ posited in Eq. ( fTTT i. The simplest is to time-reverse 
our simulation of Vm- First we will need to replace the 
m[q qq] with ^ + ill])- Reversing this 

will replace [q q] with [q ^ q] and [qq] with — 
Together this is f{[q ^ q]- [qq]) = m{[qq ^ q]- [qq]), 
plus of course the 0(logm/e) terms. 

It may be instructive to also consider a more explicit 
construction of the simulation. Note that V^]^ acts on 
basis states [x)^[y)^ as follows: 

Vl[x)^[x)'' = |x)^|0)^ Vx 
Vl\x)\)'' - forO<y<a; 

KLk)^|y)'^ = \x)^\y)^ fovy>x 

Again Alice and Bob can determine whether y < x, 
y — X or y > X to accuracy e by exchanging 0(logm/e) 
qubits, and by performing these calculations coherently, 
can uncompute this information at the end of the pro- 
tocol. As with the V,n simulation, the three cases at the 
end of the protocol are different (0 < y < x, y — 
and y > x), so it is important that they store no more 
information than which case holds. 

The interesting case is when x — y and Alice 
and Bob would like to map |x)"^|0)^ 
for arbitrary values of x. As we have argued above, 
this can be simulated by reversing ni{[q — > qq]) = 
— > q] + [99])' which requires a resource cost 
of mi[qq ^ q] - [qq]) - fi[q ^ q] ~ [qq]). Let 
US now examine this reverse procedure in more detail. 
For simplicity, suppose m = 2. The procedure we 
would like to reverse is coherent super-dense coding 
{[q^ qj + [qqj^>2[q^^ qq^, which maps [xi) ^[x2) ^ 
to [xi) ^[x2) ^\x2) ^ as follows: First Alice and 

Bob add a maximally entangled state 1$)"*^^^ Then 

^Every example of such a channel has capacity much smaller than 
logd; e.g. the channel that maps p to Af{p) = ep+ (1 — e)I/d. For 
concreteness, follow (32| and choose t = —l/{d? — 1), so Afip) = 
(dl - p)/(d2 _ 1). Then the single-shot HSW capacity Cf^'CAT) is 
Q{d~^) and the entanglement-assisted capacity Ce{J^) is S{d~^). 
Achieving this entanglement-assisted rate using the protocols of [10 
[32 1 requires log d ebits per use of J\f, meaning that we consume many 
ebits to get a small enhancement in classical capacity. Communication 
protocols which used less entanglement were given in 1 19 44|, but 
here too we conjecture that f2(log ci) ebits are necessary to raise the 
capacity of Af from 0(d^^) to Q(d~'^), or even to Q{d^^+^) for 
any 5 > 0. 



AHce applies the Pauli operator X^^Z^'^ to the A3 
system, leaving the state 

[xi)'^'[x2)'^'{X^'Z''' (g) I)[<i>)^'^' =: 

|xi,X2)-'^^1<i>.,..,)^^^\ (12) 

where [<^>x,.x2) (X^^Z^^ (g) /)|$). Note that the four 
I $2;) form an orthonormal basis, and thus we can define 
the unitary map Usd E^i^oEx^^o Ni, 2:2) (^'xi.x^ I- 
Super-dense coding proceeds by Alice sending her half 
of \^xi,x2) to Bob, who applies C/sd to yield the state 

[xi,X2)^\xi,X2)^ ■ 

Now we ex|)lain how this protocol can be reversed to 

map [xi,X2) 1x1,2:2) to [xi,X2) ■ Bob first applies 
C/gp to [xi,X2)^ and obtains [^x)^^^^- Using [q ^ 
q\. Bob sends half of [^x] to Alice, so the joint state 
becomes [xi^X2)^[^x-i,x2)^^ ■ Since Alice has a copy of 
X, she can apply [X""^ Z""^)"^ to her half of [^xuX2)^^ 
and transform it to l^o.o) = |^)- Thus, not only has 
Bob's copy of 1x1,2:2) been erased, but Alice and Bob 
are left sharing the state |<i>). This means that two bits 
on Bob's side can be coherently erased by using the 
resource [q^q]- [qq] = 2{[qq ^ q] - [qq]). 

V. Coherent erasure and time-reversal 

By now we have seen many examples of how unitary 
communication protocols can be reversed to give new 
protocols. In this section we summarize these reversal 
rules and, inspired by our observations about in the 
last section, introduce the new communication resource 
of "coherent erasure," which is the time-reversal of 
coherent classical communication. Time-reversal sym- 
metry was also discussed in |17|, which explained how 
the quantum reverse Shannon theorem|6| is the time- 
reversal of the quantum Slepian-Wolf theorem (a.k.a. 
state merging|33| or the mother lll8lll9i ), once all the 
protocols are made fully coherent. 

For the standard resources of [q q\ and [qq\, time- 
reversal is quite simple. If we let denote the time- 
reverse of a resource, then [q q]'^ = [q ^ q] 
and [5(7]^ = ^[99]- This means that sending qubits in 
one direction is reversed by sending qubits in the other 
direction, and that creating entanglement is reversed 
by destroying entanglement. As we have argued above, 
[q qq\ = {[q ^ q\ + [qq\)/2 so the time-reversal 
of [q qq] is [q qq]"^ = {[q ^ 9] - [qq])/2. This 
resource corresponds to the map [x] [x) — > |x) , so 
we call it "coherent erasure" and label it [q <— qq\. 
Since one bit of coherent classical communication is a 
cobit, we (following a suggestion of Charlie Bennett's) 
call [q ^ qq\ a co-cobit from Bob to Alice, where the 
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first "co" stands for "complementary" and the second 
"co" stands for "coherent." The co-cobit from Alice to 
Bob is denoted — > q] and corresponds to the map 

Of course the map I x) \x) is not defined for 

all inputs (what if Alice and Bob don't input the same 
state?), but in this section we will explain how coherent 
erasure nevertheless makes sense as a communication 
resource. First in Sec. IV- Al we will explain how coherent 
erasure can be derived from other resources and then in 
Sec. IV-Bl we will describe some uses of coherent erasure. 

We stress at the outset that coherent erasure is equiv- 
alent to standard resources, and there's no need to 
introduce new concepts such as "erasure capacity" and 
the like. However, it may prove a useful metaphor in 
analyzing other communication protocols. 

A. Producing coherent erasure 

We have akeady seen three ways of producing coher- 
ent erasure, which we briefly review here. 

1} Reversing clean protocols with [q — > qq]: If a clean 
resource inequality involves \q qq\, then in the time- 
reversed version of the resource inequality [q qq\ 
is replaced with [q qq]'^ = [q ^ qq]- This was 
an implicit part of the argument of Theorem |3] which 

clean 

was based on reversing ([/) > C[q qq] to obtain 

c/ean 

m > C[q^qq]. 

2) Reversing super-dense coding: This was explained 
in Sec. |IV] and is basically a special case of the last 
point: time-reversing [q q] + [qq] > 2[q ^ qq] yields 

[q ^ q] - [qq] > 2[q ^ qq]. (13) 

3) The gate V^l^: Just as Vm is equivalent to m cobits 
up to small errors and inefficiencies, V^l^ is roughly 
equivalent to m co-cobits. Of course is not 
a proper asymptotic resource, but it is still useful as a 
concrete way to imagine implementing coherent erasure. 

B. Using coherent erasure 

1 ) Entanglement-assisted communication: Theorem[3] 
(or more precisely, the protocol sketched in Sec. 
for entanglement-assisted back communication using 
t^xoxo) explained how 

[q qq] + [qq] > [qq ^ q] (14) 

by giving an explicit protocol. Another way to derive this 
resource inequality is reversing coherent teleportation 

2[g qq] >[q^q] + [qq] 



to obtain 

2[q ^ qq] > [q ^ q] - [qq]- 

Substituting [q ^ q] = 2[qq ^ q] — [qq] then gives the 
desired result. 

Interestingly, coherent erasure is capable of no com- 
munication on its own, but can convert one ebit into 
one cobit. This is a sharper version of the separation 
we observed between the entanglement-assisted and - 
unassisted capacities of V^- 

2) State merging and partial quantum communica- 
tion: First note that Eqns. (fTSl l and (fT4l i can be combined 
to obtain the equality 

[q ^ qq] = [qq ^ q]- [qq] 

^ [q^ q]- [qq] (^5-, 

= [9^9]- [qq ^ q]- 

This last point means that [q ^ q] = [q ^ qq] + [qq 
q], so the task of sending a qubit can be split into the 
tasks of sending one cobit and coherently erasing one bit. 
There is a direct protocol which performs this. Suppose 
Alice would like to send the state ax[x)^ to Bob. If 
she applies [q qq] then they will obtain the entangled 
state ^.j.a:x[x)^[x)^ - Finally, applying [qq q] will 
erase Alice's state and leave Bob with J^x '^x[x)^ - 

Of course this also works for coherent superpositions 
of messages. If Alice would like to send her half of 
J2x Oix[x)^[x)^ to Bob then she can first apply [q qq] 
to obtain 

^ax[x)"[x)^[x)^ , 

X 

and then [qq q\ will again erase Alice's state to leave 
Y.X ax[x)'^[x)^- 

The general problem here is state merging^^, in 
which Alice gives Bob her piece of a tripartite state 
[ip)^^^, perhaps generating or consuming entanglement 
in the process. Ref. ||T7| argued that cobits are the 
canonical example of feedback channels, which are 
isometries that map from A to AB. Likewise, we claim 
that coherent erasure is the canonical example of state 
merging. 

To justify this interpretation, we will now show how 
to generalize the decomposition [q ^ q\ = [q ^ 
qq] + [qq ^ 9] to a decomposition of perfect quan- 
tum communication from A ^ B into an isometry 
from A AB followed by merging AB into B. 
The isometry from A AB can be simulated by 
I{R;B)[q qq] + I{B)A)[qq] using the quantum re- 
verse Shannon theorem||6l ll7l , where the coherent infor- 
mation I{B)A) is defined as I{B)A) := -H{B[A) = 
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H{A)-H{AB) = H{A)-H{R). The I{R;B)[q qq] 
cost represents the difficulty of creating the desired 
correlations between Bob and the reference system, 
while the I{B)A)[qq] cost is necessary because back 
communication would allow Bob to distill that much 
entanglement with Alice while preserving his correla- 
tions with R. Then the tripartite state can be mapped 
(using state merging) to one where Bob holds Alice's 
part using I{R;A)[qq ^ q] - I{A)B)[qq]. Here the 
erasure cost measures the amount of correlation with 
the reference system that Alice has and needs to give 
up, while I{A)B) is the amount of entanglement that 
is recovered by the procedure once Bob has the entire 
purification of Alice's state. Indeed, this version of state 
merging amounts to a coherent version of entanglement 
distillation {I[A-E)[c c] + > I{A)B)[qq]) 

in which [c c] is replaced with [qq q] and as 
a result Bob holds the purification of the environment 
at the end of the protocol. Finally, the total resource 
cost I{R\B)[q ^ qq] + I{B)A)[qq] + I{R;A)[qq -> 
q] — I{A)B)[qq] is simply equal to H{R)[q q], the 
cost of sending the reference system directly to Bob. 

3) Rule I: Coherently decoupled input chits: Suppose 
a + C[c ^ c] > /3 is a resource inequality in which 
the classical message sent is nearly independent of all 
residual quantum systems, including the environment. 
In this case we say that the input cbits are coherent 
decoupled and "Rule I" of 1 18 19 23 1 proves that they 
can be replaced by C{[q qq] — [qq])- Equivalently, 
we can replace them by C[qq q]. Thus, coherent 
erasure can be used whenever we need to send a classical 
message whose contents can be guaranteed to be almost 
completely independent of the remaining quantum sys- 
tems. 

The simplest example of a coherently decoupled input 
is of course teleportation, which quickly leads us to 
the familiar resource inequality 2[qq ^ q] + [qq] > 
[q ^ q]. A slightly more nontrivial example is remote 
state preparation! 8 1, in which logd ebits and logn 
\ogd + log log d + 21ogl/e + 0(1) cbits are used for 
Alice to prepare an arbitrary d-dimensional state in 
Bob's lab (i.e. \ogd "remote qubits"). Since the input 
cbits are coherently decoupled, they can be replaced 
by (iog n){[q qq] — [qq]). Asymptotically this means 
that one coherent bit is at least as strong as one remote 
qubit, which is an interesting statement because cobits 
and remote qubits both lie somewhere in between cbits 
and qubits, and there does not appear to be any trivial 
protocol relating the two. Another way to interpret 
remote state preparation is that [qq q] + [qq] yield 
one remote qubit; this, by contrast, can be implemented 



by a relatively straightforward single-shot protocol. Sup- 
pose the state that Alice wishes to prepare for Bob is 
]a) — A particularly easy case is when 

= for all x. Then Alice could locally map 
l*d> — ■^T,t=i\^)^\x)^ to T,i=i(^x\x)^]x)^ and 
then use log d[qq q] to leave Bob with the state 
J2t=i0^x\x)^. 

In general, |q!j;| will not always be equal to 1/Vd, and 
blithely applying the above method for a general state 
will only achieve a fidelity of F{a) :— lo^ajl/v^- 
Moreover, even a randomly chosen state will usually 
have F close to \/tt/2 when d is large (as can be seen 
using 'E]ax] = y^n/Ad and the central limit theorem). 
This means that multiplying |a) by a fixed random uni- 
tary will not be sufficient to obtain high fidelity. Instead, 
we will use a small sequence of unitaries {Ui, . . . ,1/^}, 
along with a K-dimensional ancilla register that controls 
which unitary is applied. To decode. Bob will need this 
ancilla register, which we will require log k extra qubits 
of communication from Alice to Bob. Fortunately, we 
will see that the error shrinks rapidly with k for a variety 
of choices of {Ui, . . . , [/«}. 

The procedure for Alice to remotely prepare |a) in 
Bob's lab is as follows. Let 

]P) ■.^-j2m,]a) = Y,p,]b,)]x), 

k=l x=l 

where in the last expression we have introduced states 
]bx) for each x. Assume for now that F{f3) = 
is close to one. Then, starting with the 
shared state ]^d), Alice can create 

If she then applies \ogd co-cobits to AB and sends A' 
to Bob using log k qubits then Bob will have a fidelity- 
F{(3) approximation to from which he can obtain 
|a). 

We can summarize the correctness of the protocol as 
follows: 

Theorem 4: There exists a subspace V C with 
d' :— dimV — (ie^/32 log(5/e) such that the remote 
state preparation protocol above can prepare any state 
]a) e in Bob's lab using \ogd[qq q] + logd[qq] + 
log K[q — > q] and with fidelity 1 — — 2e. 

This translates into remotely preparing a state of 
logd' = lege? — 2 log 1/e — 0(loglog 1/e) qubits, which 
is similar to the performance of [8]. 

Proof: To analyze the protocol, we first seek to 
lower bound E,aF{P), where a is uniformly randomly 
chosen. We will choose Uk to be \^ + f^){x\ for ^ ^ 
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{1, . . . , k} and where addition is mod d. We could also 
choose the Uk to be (approximately) mutually unbiased 
bases (meaning that |(a;|J7^C/fc' |a;)p is (approximately) 
equal to 1/ d), but we omit the analysis here. To evaluate 
the expected fidelity, we use linearity of expectation: 



a 

1 ^ 



x=l \ k=l 



(16) 



Using the fact that the distribution of \a) is rotationally 
invariant, we find that 



EaFiP) 



where P is a rank-K projector (assuming that k < d). 
To estimate this last term, we will use the inequality 
E\X\ > {EX^)i/{EX^)i which holds for any random 
variable lITTI . Next, we calculate ETrPa = K/d and 
E(Tr Pa)2 = Tr(P (g) P) • E(a (g) a) = Tr(P (g) P) • 
(/+SWAP)/d(d+l) = K{K+l)/d{d+l). Putting this 
together we find that 



EaP(/3) > 



1 + i 



> 1 - 



1 

2k' 



The remaining steps are quite similar to the argu- 
ments in [ITl l29|. We will argue that not only is F 
close to its expectation for most values of a, but in 
fact if we choose a random subspace V of dimension 
d' = 0(c?e^/32 log(5/e)) then with nonzero probability 
every vector \a) V will have F{f3) > 1 ^ ^ — £• 
Observe that the Lipschitz constant of P(/3) (defined to 
be maxi \ ^^idF / dajf') is constant. In fact, it is 1. 



(17) 



Then Levy's Lemma||35| states that 



Pr 



Zk 



<2exp(- — 



for any e > 0. Now consider a random d' -dimensional 
subspace V. According to |29, Lemma III. 6], we can 
choose a mesh of (S/e)"^' = exp(de^/32) points such 
that any point in V is within e (in trace distance) of 
some point in the mesh. Applying the union bound and 
the triangle inequality to Eq. ( fTTI i. we find that there 
exists a subspace V such that F{a) > 1 — ^ — 2e for 
all |a) eV. " ■ 

Thus coherent erasure is an alternate, and arguably 
more direct, way to think about remote state preparation. 
It is perhaps interesting that our protocol is nontrivially 
different from the comparably efficient protocol that 
could be obtained from applying |[T8][l9]|23rs "Rule I" 



to iU. While Rule I guarantees that in general co-cobits 
can be used in place of coherently decoupled input bits, 
the proof is indirect and involves catalysis. It would 
be interesting to know whether there is a more direct 
and natural use of coherent erasure, such as the one we 
showed for remote state preparation, in any protocol with 
coherently decoupled input cbits. 

VI. Conclusions 

The results of this paper help resolve many ques- 
tions surrounding unitary gate capacities. We now un- 
derstand that, while all the capacities of any nonlocal 
gate are nonzero, there can be asymptotically large 
separations between these capacities. The main sepa- 
ration left unproven by this paper is finding a gate 
with E{U) ^ C+(C/). An early version of this paper 
proposed the gate on g) which exchanges |01) 
and \^d), while leaving the other states unchanged: 
U = /-|01)(01|-|$d)($d| + |01)($d| + |$rf)(01|. Since 
U requires log d cbits to simulate, even using unlimited 
EPR pairs (as we will prove later in this section), the 
simulation techniques in this paper will need to be 
modified. Since the first version of this paper appeared, 
||26| established the conjectured separation (showing that 
Cf{U) < O(loglogd) < logd < E{U)) by using non- 
maximally entangled states for the simulation. 

Another limitation of our work is that the separations 
we have found are between o(logd) and « logd; on the 
other hand, IMI has proven that if E{U) = 2 log d then 
= 2\ogd as well. It would be interesting to see 
which capacity separations are possible for gates with 
E{U) between logd and 2\ogd. 

Of course, separating communication capacities is 
mostly intended as a step towards better understanding 
quantum communication using unitary gates as well as 
other resources. For example, it has led us to the resource 
of coherent erasure, which hopefully will turn out to be a 
useful concept the way coherent classical communication 
has. 

Entanglement destruction is another resource in quan- 
tum information theory that seems to be worth explor- 
ing. Once we restrict protocols to be clean, destroying 
entanglement is a nonlocal task. Equivalently, creating 
coherent superpositions of states with varying amount 
of entanglement requires communication, even if the 
two parties are allowed unlimited numbers of maximally 
entangled states ll30l . This task comes up in entanglement 
dilution li27H30ll . in its generalization, remote preparation 
of known entangled states (though this is not explicitly 
acknowledged in |I8]), and in the quantum reverse Shan- 
non theorem||6|, which may be thought of as a further 
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generalization of remote state preparation. In each case, 
the resource of "entanglement spread" — meaning the 
ability to generate superpositions of states with varying 
amounts of entanglement — appears to be necessary. En- 
tanglement spread can be generated (using e.g. remote 
state preparation) by sending cbits in either direction, 
or even without any communication at all, if Alice and 
Bob can make catalytic use of an embezzling state 1281 . It 
appears that a unitary gate U has "spread capacity" equal 
to E{U) + E{W); for example, a simple application of 
1301 can prove that Ci[c ^ c] + C2[c ^ c] + oo[qq] < 
(U) implies that Ci + C2 > E{U) + E{W), a result 
previously proved only for two-qubit gates, using very 
different arguments fT2\. Of course, without a precise 
definition of entanglement spread it will be difficult to 
formalize these arguments. Ref. lf30l is a promising first 
step towards defining spread as a resource, though the 
unusual scaling of embezzling states and of the cost 
of entanglement dilution suggest that the i.i.d. resource 
model of HHIll] might not fit well. 

VII. Notation 

In this section, we collect some of the notation used 
in the rest of the paper. 

TABLE I 

Definitions of communication resources 



abbr. 


name 


formula 


[gg] 


ebit 


^(|oo>-4^ + |ii)-^«) 


[c^c] 


cbit 




[q^q] 


qubit 


ELo 


[q qq] 


cobit 


ELoi-)^i-)"<-i^ 


[q ^ qq] 


co-cobit 


ELoI-)^<-|-'<-I'' 




unitary 


u 



Cobits were introduced in |l22l| and co-cbits were 
introduced in Sec.[V] Note that co-cobits are only defined 
when Alice and Bob's joint state is constrained to lie in 
the subspace spanned by |00) and 

To understand the relations between the resources in 
Table m we describe the effects of exchanging Alice and 
Bob and of running a protocol backwards. These are 
listed in the "exchange" and "reverse" columns of the 
table below. For example, [q q] means sending a 
qubit from Alice to Bob, so either exchanging Alice and 
Bob or reversing time transforms [q — > q] to [q ^ q]. 
However, given a cobit from Alice to Bob, exchange 
and time-reversal do not act the same way: exchanging 
Alice and Bob yields a cobit from Bob to Alice while 
time-reversal yields a co-cobit from Bob to Alice. 



TABLE II 

Exchange and time-reversal symmetries 



resource 


exchange 


reverse 


[qq] 


[qq] 


-[qq] 


[c^c] 




undefined 


[q^q] 


[q^q] 


[q^q] 


[q qq] 


[qq ^ q] 


[q ^ qq] 


[q ^ qq] 


[qq q] 


[q qq] 


(U) 







In the first line of Table [III we can consider [qq] to 
be the action of creating an ebit. Thus, the time-reversal 
of [qq] corresponds to destroying an ebit, which, if done 
coherently, is a non-trivial resource. In the last line, T 
denotes the unitary operator that exchanges Alice and 
Bob's systems. The time-reversal relations for cobits and 
co-cobits are explained in Sec. |V] 

Next, we summarize some of the basic transformations 
of the resources in Table |T] that are possible. We have 
indicated where the protocols can be done cleanly by 

. c'Sf " clean . . ^ 

usmg > or — mstead of > or 



TABLE III 

Transformations between standard resources 



2[c- 




- [qq_ 


clean 

> [q- 


^q] 


teleportation 


[q- 




- [qq_ 


clean 

> 2[c- 




super-dense coding 


[q- 




- [qq_ 


clean ^ r 

= 2[g- 


qq] 


from (22) 


[q- 




- [qq_ 


clean ^ r 

= 2[g < 


- qq] 


from Sec. |V] 


[q qq] - 


Hq- 


- qq_ 


clean r 

= [q- 


>q] 


from last two lines 



Now define [/ to be a bipartite unitary gate. We have 
defined various capacity regions in Sec. U which are 
summarized in Table |IV] below. 

Finally, we will explain how the results of the paper 
relate to the terms in Table |IV] First, some of the 
theorems relate as follows: 

• Theorem [T] shows that the capacity regions CCE 
and clCCE are nearly equivalent, other than the fact 
that entanglement can be thrown away in non-clean 
protocols. As a corollary. Theorem [T] also relates 
CE to clCE the same way. 

• Theorem |2] and Lemma [T] concern the region 
clCE(C/), giving a single-letter formula for it, and 
proving its continuity (in terms of U), respectively. 

• Theorem [3] shows that clCCE([/^) can be com- 
pletely determined from clCCE(L/) (and vice-versa, 
of course). 

Finally, Table FVl summarizes the separations in capacities 
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TABLE IV 

Capacities and capacity regions of unitary gates 



CCE(C/) 


= {(C-i,C2,E) : U > Ci[c^c] +C2IC*- 


d +E\qq]} 


clCCE(C/) 


clean 

= {{Ci,C2,E) : U > Ci[c^c] + C2[c^ 


-c]+Elqq]} 


CE{U) 


= {{C, E):U>C[c~*c]+ Elqq]} 




clCE(C/) 


= {{C,E) : U'''^'>"' C[c ^ c] + E[qq]} 






= max{C : U + oo[ijg] > C[c -* c]} 






= max{C : [/ > C[c ^ c]} 




Ct{U) 


= max{C : U + oo[qq] > C[c ^ c]} 




C^{U) 


= max{C : [/ > C[c ^ c]} 




C+iU) 


= max{Ci + C2 : U > Ci[c -* c] + C2[c 


^c]} 


C^iU) 


= max{Ci+C2 : U + oo[qq] > Ci[c-^c] - 


t- Calcic]} 


E{U) 


= max{£; : U > E[qq]} 




proved in 


Sec. Hn] and Sec. |IV] 





TABLE V 

Exchange and time-reversal symmetries 



f. vs. b. comm. 




ent. create/destroy 


E{vl) < 


ent.-assted f. vs. b. comm. 




comm. w/ or w/o ent.-asst. 





Here f. means forward, b. means backwards, comm. means communi- 
cation (which can be equivalently taken to be cbits, cobits or qubits), 
ent. means entanglement, and asst/assted means assistance/assisted. 
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Appendix 

In this appendix we sketch alternate proofs for Theo- 
rems [U and 121 Both proofs we give are slightly simpler 
than previous versions and have slightly better con- 
vergence properties. Moreover, by including them, this 
paper can be more self-contained, especially given that 
the previous statements of Theorem [T| in II23II25I were 
slightly weaker 

Proof of Theorem Q} We begin by following the 
approach of ||25]| . where this was first proved. Suppose 



([/) > Ci[c ^ c] + C2[c ^ c] + E[qq] for some E>Q 
(similar arguments apply for E < 0). Then, if Alice and 
Bob copy their inputs before sending and refrain from 
performing their final von Neumann measurements, we 
obtain a sequence of protocols Vn such that for all x £ 
{0, 1}^'^"' and y e {0, 1}^2"' (with cf := n(Q -(5„) 
for J = 1,2) 

(18) 

The main difference between Eq. (fTsT l and our goal 
(Eq. dUi) is the presence of the ancilla \ipx.y)^^ with 
its arbitrary depends on x and y rather than a string 
of zeroes that depends only on n. Simply discarding 
\'Px,y)^^ will in general break superpositions between 
different values of x and y. Also the ancilla is not 
guaranteed to fit in o{n) qubits. 

Ref. [|25i made a series of modifications in order to 
obtain a clean protocol. In fact, ll25l obtained a sUghtly 
weaker result than Eq. (Hli, in which Alice and Bob are 
left with an ancilla \Lpn) which depends only on n 
and (it can be shown) can be stored in < nS'^ qubits. 
We call protocols of this form "semi-clean," but when 
working with unitary gates this implies the protocol can 
be made clean at an asymptotically negligible additional 
cost. This is due to |24|, which proved that any nonlocal 
gate U can exactly generate any other fixed gate, such 
as SWAP, with a constant number of applications inter- 
spersed with local unitaries. Thus, 0{n5[-^) applications 
of U can exactly map \if>'^^) to |00)"*". 

We now review the steps of ll25l in obtaining a semi- 
clean protocol before we describe our alternate approach. 
First they used entanglement as a sort of coherent one- 
time-pad, so that the ancillas would become correlated 
with the one-time-pad and not the message. Then a 
classical error-correcting code was applied to reduce the 
errors, and finally entanglement concentration]]?) was 
used on the error-free blocks to recover the entanglement 
used for the one-time-pad. We use an approach that 
is only slightly different: first using block coding to 
reduce the error to a nearly exponentially small amount, 
then using a coherent one-time-pad to decouple the 
ancillas and finally recovering entanglement using an 
approximate form of entanglement concentration that 
needs far fewer states. 

We now explain these components in more detail. 
First, absorb all of the output into the ancilla, except 
for the locally copied inputs, so that 

Vn\x)%f = |x)^^|j;>^^|^.,,)^^^-^^^^% (19) 
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with Tr(|zj) {y\^' ® \x) {xl""' ® I^''''W.,y > 1 - and 
with entanglement E{\ipx,y)) := H{^^^y^^) > n{E - 
(5„), for an appropriate redefinition of (5„. This is clearly 
an equivalent formulation, but it allows us to speak more 
easily about the exact output of the protocol. 

Now we will use classical bidirectional block- 
coding|43| to control how quickly e„ vanishes as a 
function of n. By applying Vm "2 times and slightly 
reducing the rate, it is possible to send nin2{Ci—&) cbits 
forward and 774^2 ((^2 — 5) cbits backwards with nin2 
uses of U and average error < exp(ri2(l + Q!loge„)) as 
long as (5 > 5m + a + i?2(a)/(?ii(min(Ci, C2) - 5„i)). 
This can be simplified by choosing a — 2/log(l/e„), 
so the error probability is < exp(— 712) and we still have 

(5^0. Finally, for any < 7 < 1, choose n2 — n^'' , 
so if n := 7^lr^2, then 712 = 77^^''. Thus, we can without 
loss of generality assume that we have a sequence of 
protocols {Vn)'^=i with inefficiency (5„ and error e„ 
satisfying lim„_^oo &n = and e„ < cxp(— 77^^''') 
(in fact, we could choose e„ < exp(— /(77)) for any 
f{n) = o{n) that we like)0 Now we see the reason for 
using the form of Eq. ( fT9l ); our block codes still satisfy 
Eq. (fT9] l. but not necessarily Eq. ( fTSl l. since there may 
be arbitrary errors in the entangled states. 

Next, use a coherent one-time-pad in the same way 



as 



. Alice and Bob start with c!"^ + Ck"' ebits. 

A B 

Together with their initial messages I x) \y) , their state 
is 



^ aeAbeB 



where A {0, , B := {0, l}^^ , and N := 

\A\ ■ \B\ = exp(C^"^ +C^"'')- This can be locally mapped 
to 1 

aeAbeB 

Relabelling the sum over a, b shows that this is equiva- 
lent to 

^ ^^\a®x,b®y,a)'^\a®x,b®y,b)^. 

aeAbeB 

Now Vn is applied to |a) \b) , obtaining 



^ '■^ aeAbeB 



y, a)'^\a ®x,b®y, b)^\ipa,b)- 



*This step closely resembles the code construction in 1251 , which 
contains a slightly more detailed proof. The basic tools for the 
proof can also be found in 1 14 43 1; note that they (esp. t43\) only 
bound the average decoding error, rather than the maximum eiTor 
(similarly in 1 12 1, which introduced the idea of double-blocking unitary 
communication protocols). Average error can easily be turned into 
maximum error] 20], though in our case it is not necessary, or rather, 
our entire protocol can be thought of as a coherent version of j 20 1 . 



Since Tr(|6)(6r ® |a)(a|^ ® I''^)ipa,b > 1 - £«, we 

A B 

can extract \b) \a) from |<y5a,b) while causing 0{^/e^) 
disturbance. This yields a state within 0{^/t^) of 



1 



\a®x,b(B 7/,a,6)'^|a© x,b®y, b, a)^\(pa^b), 

^ aeA 
beB 

which can be locally mapped to 

\x,y)^\x,y)^-^ \a,b)^\a,b)^\ipa.,,) 

aeAbeB 



B, -\AB 



= : \x,y) \x,y) \ip) 

Thus, we have performed the desired coherent commu- 
nication (up to error O(Ve^) = 0(exp(— 77^^'''/2))) and 

AB 

converted 77(Ci + C2 — 2(5„) ebits into , which is 

the fixed pure state 

^ E El"'^^'^|"'^^'^|'^°■f'^■ 
^^^ aeAbeB 



yAB 



The final step is to recover entanglement from | ip) ' 
Since E{ifa.b) > ni^E ~ (5„), we can bound Eq :— 
E{lp) > n{Ci + C2+E - 3(5„). Also, |(^)^^ can be 
obtained from 0{n)[qq\ + 0{n){U), so it must have 
Schmidt rank < exp(0(77)). We would like to repeat 
the entire protocol k times in parallel, and then apply 
entanglement concentration to jcp)®*^ in order to recover 
standard EPR pairs. This will inevitably increase our 
error; we get up to fc • 0{y^) from repeating the pro- 
tocol, and some additional errors from the entanglement 
concentration. However, as long as our final error e' is 
0(1) then we can use the catalytic entanglement safely; 
we merely repeat the protocol [1/e'] times for a total 
error of Ve' and an additional fractional inefficiency of 
\/e', both of which are still o(l). 

Unfortunately, the original entanglement concentration 
protocol of |5 1 will not suffice for this purpose (without 
using a more elaborate procedure, as in ll^Sl ). This is 
because [5] requires fc/logfc to grow faster than the 
Schmidt rank of meaning that k = exp(ri(77)). Since 
e„ ~ exp(— 0(77)), we would be unable to guarantee that 
ken = 0(1). 

Instead, we will use an approximate version of en- 
tanglement concentration (due to Andreas Winter|47|) 
which only requires k — poly log(Sch((^)) = poly(77) 
to achieve vanishing error and inefficiency. Since 
poly(7i)e„ — 0(1), this will complete our proof. ■ 

It now remains only to describe Winter's new version 
of entanglement concentration[47 |. Since this result may 
be more broadly useful, we rename the variables from 
the proof of Theorem [T] to more conventional notation. 
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and state a slightly more general result than we need 
above. 

Theorem 5 ( due to A. Winter): Let 

, . . . ,\ipn)^'^ be bipartite states each with 
Schmidt rank < d, and with total entanglement 
E :== J27=iH{^f). Then for any 5 > such that 
n > max((5~^3(log(i)^, 20(5^^ log 71^2' Alice and Bob 
can extract E — n5 ebits from It/ji) (g) . . . (g) 
with error 0(exp(— (n5^)^/^/21n2)) using no 
communication. Up to the above error, their residual 
state has Schmidt rank < cxp(2n(5). 
In particular, E{1 — o(l)) ebits can be extracted with 
error o(l) while creating a sublinear-size garbage state 
if n = poly(log(i). This suffices to complete the above 
proof of Theorem [T] 

Proof of Theorem^ First we describe the protocol, 
then analyze its correctness. Alice and Bob begin by 
using local unitaries to rotate the Schmidt basis of each 

into a standard basis. This leaves them with the state 

where {p\, . . . are the Schmidt coefficients of 
(possibly not all nonzero). 

Next they each project onto the subspace where the 
Schmidt coefficients are in the range exp(— i?± nS/2); 
that is spanned by . . . ,jn) ^ bi, ■ • ■ ,jn) for those 
values of ji, . . . , j„ satisfying | logp}, +£^1 < n5/2. 
We will later argue that this projection almost always 
succeeds, and thus causes very little disturbance. 

Now they divide the interval [cxp(— — 
n6/2), exp{—E + n6/2)] into m bins with geometrically 
spaced boundaries, for m a parameter we will pick 
later. That is, for k — 1, . . . , m, bin k is the interval 

/ i ~ 1 f ^T-^ 3 c-\ 

exp[-E \-- ,exp h— n(5 . 

\ 2m / \ 2m/ 

Still without using any communication, Alice and Bob 
each perform a projective measurement onto the different 
bins. By this we mean that the measurement opera- 
tors are projectors onto subspaces spanned by strings 
. . . ,jn) whose Schmidt coefficients fall entirely into 
one of the above intervals. We claim that (a) that with 
high probability they will find a bin that contains many 
eigenstates, and (b) the resulting state will have high 
fidelity with a maximally entangled state. It remains only 
to quantify the various errors and inefficiencies we have 
encountered along the way. 

Start with the last step. The probability that the bin 
measurement yields a bin with weight < e (for e a param- 
eter we will set later) is < me. Each Schmidt coefficient 



that we have kept is < exp{—E + n5/2), and thus any 
bin with weight > e must contain > ecxp{E — nS/2) 
eigenvalues. Choosing e = 2^"'^/^, we obtain a state that 
approximates > log l/e + E—n6/2 = E — n6 EPR pairs. 
To assess the fidelity of this approximation, note that all 
of the Schmidt coefficients of the projected and rescaled 
state are in a band between A and \exp{~n5/m) > 
A(l — n(5(ln2)/m), for some normalization factor A. 
Thus this state has fidelity 1 — 0{n6/m) with a maxi- 
mally entangled state. Now choose m — 2"'^/'*, so that 
both the me failure probability and the 0{n5/m) error 
are < exp(— n5/5). 

All that remains for the error analysis is to assess the 
damage from projecting onto the Schmidt coefficients 
in the interval —E ± n5/2. The key tool here is the 
Chernoff bound lfT3l . which states that if Xi,...,X„ 
are independent (not necessarily independent) random 
variables satisfying < < 7 then X := X^ILi 
satisfies 

p(|X-EX|>4)<2exp(--^). (20) 

We would like to apply this with Xi defined by P{Xi = 
logl/pp = pj, so that X is likely to be close to 
EX = E. Unfortunately, can be arbitrarily close to 
zero, and thus log 1 /p* can be arbitrarily large, so we 
cannot immediately establish any upper bound on To 
do so, we will discard the Schmidt coefficients that are 
smaller than 2^^ , which automatically means that < 
Xi < 7. This causes <d2 ' damage for each or 
< nd2~'' overall. Combining with Eq. (|20] i. we find the 

total error is mi2^^+e . Finally we set 7 = {nS'^)^'^ 
to obtain an overall error of 0(exp(— (nJ^)^/'^/2 ln2)), 
which dominates the exp(— n(5/5) error from the first 
part. 

Now we explain how the protocol can be made semi- 
clean. The "which bin" measurement should instead be 
performed coherently, with the entanglement extraction 
proceeding conditioned on the quantum register storing 
the superposition of measurement outcomes. Since the 
different outcomes remain locally orthogonal, the overall 
Schmidt rank is equal to the sum over k of the rank 
of the state conditioned on obtaining bin k. After the 
initial projection onto the typical subspace succeeds, 
each Schmidt coefficient is > exp(— — n6/2), and 
so each bin has rank < exp(i<^ -I- n6/2). Since we are 

'The Chebyshev bound would avoid these difficuhies, but at the cost 
of losing the exponential bounds on error probability. Nevertheless, 
for small values of n, it may be preferable. Here we can use the fact 
that Var(X) < ralog^d to find that P{\X - EX\ > nS/2) < 
4{log d)^ / nS'^ . Other than the revised eiTor bound, the rest of the 
proof would be the same. 
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extracting E — nS ebits from each bin, the rank of the 
residual state, conditioned on k, should be < exp (^^n6). 
Once we sum over m = 2"^^^ bins, we have an overall 
Schmidt rank of < exp(|n(5) < exp(2n5). ■ 
Proof of coding theorem for Theorem ^ Suppose 
there exists an ensemble £-^^^ = J2xPx\^)(-'^\^ ® 
ipx^'^^' such that 



I{X;BB'] 



U{£) 



I{X;BB')i 



C 



and H{BB'\X)ui£) - H{BB'\X)e - E, 



(21) 



The idea is to use HSW coding 131] gU for U{£)^^' . 
For a string x = (xi, . . . , Xn), let l-^x) := 
\4>xi) ® •■■|V'x„)- The HSW theorem states that for 
e, (5 > and for n sufficiently large, choosing 
N := exp{Ti{I{X; BB')u(^£'f — S)) random codewords 
[/^"(V'jJ'^^', . . . , J7^"(V'x«)'^^' according to the dis- 
tribution p{x) := PxiPx2 ■ ■ -Pxn will result in a code 
with average error < e. 

On the other hand, the operator Chernoff bound[l46l 
states that a collection of M exp(n(/(X; BB')£+S)) 
random codewords , ■ ■ ■ , i^xM will have average state 
on Bob's side quite close to their expectation 



(gin 



BB' 



(22) 



yABB' 



Choose an arbitrary purification \9) . Also let N = 
LM, so i = exp(n(/(X; BB')u(^£) - I{X; BB')^ - 
2(5)) — exp(n(C — 25)). Our strategy for the rest of the 
proof is for Alice and Bob to start with a state where 
Bob's part always looks like 6^^ , but Alice can reliably 
send one of L different messages by performing a local 
unitary and then applying U to the joint state. 

With this in mind, we now rephrase the random 
codes described above. Draw {xij} 



i.jh<£[L],je[AI] 



from the 



distribution p, and let be the probability of error 
when Bob attempts to decode C/®" | tA^. ^. ) . The HSW 
theorem states that with high probability the average 
error is low, i.e. 



L M 



i=i j=i 



< €. 



(23) 



On the other hand, the operator Chernoff bound states 
that with high probability 



1 



i=i 



(24) 



for all i e \L\ (the reason to demand error e^/2 will later 
be apparent). Using the union bound (see e.g. the proof 
of Theorem 1 of |fT6l for detailed calculations), one can 



show that in fact with high probability both Eqns. 
and (l24l i hold simultaneously, and in particular that there 
exists a set of {xij} for which this is true. Fix this set 
for the rest of the proof. 

For each i, let := e^j/M and define the set of 
good codewords to be G = {i : < 2e}. By Markov's 
inequahty, |G| > L/2@ The communication protocol 
proceeds as follows: 

(1) Alice and Bob start with the state . 

(2) To send the message i E G, Alice will perform a 
local unitary operation so that the overall state is 
within e of 



M 



, AA'BB' 



This is possible because of Eq. (l24l l. Uhlmann's 
theorem|36|, and the fact that two mixed states 
with trace distance < /2 have purifications with 
trace distance < e 119, Lemma 2.2]. 
(3) Apply ([/-^-B-jigin jjj^j jjjg parties shai-e a 

state within e of 



1 



M 



M 



AA'BB' 



(4) Bob decodes coherently, to obtain a state within 

e + Bi < 3e of 



1 



'M 



M 

^^^^AB^^^^^AB^^AB^ 



AA'BB' 



(5) 



Conditioned on Alice and Bob concen- 

trate w nH{BB'\X)jj(£) ebits from {U^^ ® 

I)®''^\^]xi j)^^ ■ Since the dimension of the 
states is fixed and n can be made arbitrarily large, 
the entanglement concentration technique of l5l 
will suffice. Moreover, entanglement concentration 
can be performed cleanly, so a sublinear amount of 
additional communication will leave them with the 
state M-^/^Y.]li\j)%f^ which is of course 
equivalent to n{I{X;BB')£ + 6) ebits. 

Alice and Bob have used U n + o{n) times and sent 
n{C — 2(5) cobits. They started with the state 
which can be prepared with entanglement di- 
lution using nH{BB')£ + o{n) ebits and o{n) cbits ll39]| . 
and end with n{I{X;BB')£ + H{BB'\X)u(£)) ± 
o{n) ebits, for a net change of n{H{BB'\X)ij(^£) — 
H{BB'\X)£) ± o{n) = n{E ± o(l)), as desired. ■ 

^Note that unlike in standard HSW coding, we cannot simply throw 
out the worst half of all codewords, since then the Xi^j would nol 
onger be independent and Eq. )24t would no longer necessarily hold. 



logL = 

\n\ABB 
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Note that unlike the proof of Theorem[T](or indeed the 
proof in 122)1 of the present result), no double-blocking is 
necessary here, except perhaps to deal with the sublinear 
communication used for entanglement dilution and to 
erase the ancilla states left by entanglement concentra- 
tion. 
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